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PROBLEM 


Obtain  optimum  realizable  automatic  detection  methods  for  future  Fleet  use. 
Specifically,  develop  distribution-free  radar  detectors  for  antijamming  properties.  For 
this  purpose,  define  and  investigate  the  properties  of  a  statistic  which  is  used  to  deter¬ 
mine  whether  a  sample  has  extreme  values.  Investigate  whether  this  statistic  has  the 
property  of  achieving  a  preassigned  probability  of  falsely  rejecting  the  null  hypothesis. 
Compare  the  properties  of  this  new  statistic  to  those  of  the  well  known  Mann-Whitney- 
Wilcoxon  {/-statistic  and  suggest  some  radar  applications. 


RESULTS 


1 A  distribution-free  detector  (statistical  test)  is  defined.  Detection  is  based  upon 
the  sensing  of  extreme  values  of  the  radar  signal. 

2.  The  test  is  compared  to  the  Mann-Whitney-Wilcoxon  test.  It  is  found  that  the 
new  detector,  which  is  distribution-free  with  respect  to  the  class  of  distributions  that 
are  symmetric  about  zero,  can  achieve  closely  a  preassigned  probability  of  falsely  reject¬ 
ing  the  null  hypothesis  when  it  is  true  provided  the  size  of  the  sample  is  large  enough. 
Also,  if  the  sizes  of  two  independent  samples  are  sufficiently  large,  the  detection  proce¬ 
dure  using  the  Mann-Whitney-Wilcoxon  {/-statistic  yields  the  same  probability  results  as 
those  of  the  new  detector  using  one  sample. 

3.  The  detection  procedure  seems  to  be  poor  in  one  radar  application  (against  single 
targe's)  but  relatively  good  in  another  (against  multiple  targets). 
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INTRODUCTION 


In  most  of  the  signal  detection  problems  treated  in  the  literature,  the  physical  and 
statistical  characteristics  of  the  signal  and  noise  are  well  enough  known  that  the  func¬ 
tional  forms  of  their  distributions  can  be  stated  and  parameters  of  the  distributions  can  be 
specified.  Frequently,  though,  it  is  possible  only  to  make  general  assumptions  concern¬ 
ing  the  forms  of  the  distribution  functions  of  noise  and  signal.  We  must  then  find  a 
statistical  testing  procedure  which  can  distinguish  between  the  signal-present  case  and 
the  noise-only  case  on  the  basis  of  that  small  amount  of  information. 


Scope  of  Report 

In  this  report  we  discuss  the  problem  of  signal  detection  in  a  sample  of  size  n 
under  the  assumption  that  signals  arise  from  a  stochastically  larger  population.  Such  a 
problem  occurs  in  the  radar  ECM  environment.  Hie  time  element  in  signal  detection 
makes  the  problem  relevant.  More  specifically,  the  equivalent  statistical  problem  we 
wish  to  discuss  is  that  of  deciding  whether  extreme  values  in  a  sample  come  from  the 
same  distribution  as  the  main  body  of  the  sample. 

In  general,  a  statistical  decision  procedure  has  the  property  that  the  error  of  reject¬ 
ing  the  null  hypothesis  when  it  is  true  (false-alarm  probability)  can  be  controlled.  For 
our  consideration  this  means  that  we  wish  to  control  the  error  of  deciding  that  extreme 
values  are  present  when  actually  no  extreme  values  are  present. 

We  describe  a  distribution-free  procedure  for  determining  the  presence  of  extreme 
values  on  the  basis  of  a  single  sample  of  size  n,  such  as  u«ta  from  a  multiple-range-bin 
radar  on  a  single  pulse.  A  statistic  S,  based  on  a  sample  of  size  n,  is  defined  and  its  prop¬ 
erties  are  investigated,  especially  the  property  of  controlling  the  error  of  incorrectly 
rejecting  the  null  hypothesis. 

It  is  shown  that  the  statistic  S  is  distribution-free  over  the  class  of  cumulative  con¬ 
tinuous  distribution  functions  which  are  symmetric  about  zero.  A  comparison  of  per¬ 
formance  is  made  with  the  well  known  Mann-Whitney-Wilcoxon  (7-statistic.  Some 
applications  are  presented. 


Preliminaries 

Any  distribution-free  decision  that  a  value  is  extreme  must  be  based  only  on  com¬ 
parison  with  other  sample  values.  For  example,  we  compare  the  k  largest  values  or  the 
q  smallest  values  with  the  remaining  values  and  on  the  basis  of  this  comparison  make  a 
decision  as  to  whether  or  not  extreme  values  are  present  in  the  sample.  The  source  of 

our  data  is  a  sample  of  size  n  (x1 . x„).  We  restrict  ourselves  to  the  problem  of 

detecting  large  extreme  values  because  of  the  nature  of  the  applications  we  wish  to 
make.  It  is  easy  to  see  that  if  we  sample  simultaneously  from  two  populations,  one  of 
which  is  stochastically  larger  than  the  other,  we  can  expect  extreme  values  to  occur 


-  vo'-w.y,  feat*** 


more  frequently  than  if  we  sample  from  only  one  population.  Therefore,  we  formulate 
our  hypotheses  as  follows: 

Hq:  (no  signal),  x. i  =  1 , ...,  n  has  distribution  F(x),  where  F(x)  =  1  -  F(-je);  that 
is,  the  distribution  of  x  is  symmetric  about  zero. 

/f  j (one  or  more  signals).  Some  proportion  p  of  the  sample  has  distribution  G(x) 
where  G(x)  =  F(x-a),a  >  0,  and  the  rest  of  the  quantities  x  come  from  the  population 
whose  distribution  is  F(x). 

This  means  that  we  are  sampling  either  from  one  population  or  from  two  popula¬ 
tions,  one  of  which  is  stochastically  larger  than  the  other;  that  is,  G(x)  is  stochastically 
larger  than  F(x);  (F(x)  >  G(x)). 

With  the  above  formulation  of  the  hypotheses  we  wish  to  prove  the  following  lemmas. 

LEMMA  I.  Given  a  random  sample  of  size  n,n  =  2m,  where  each xj ,  i  =  1,  ...,n  has  the 
distribution  F(x)  of  HQ,  then  g|pj , ...,  y^\  =  and  the  distribution  of 

-y\  •  •••>  -ym  is  the  »“*  ->-v  7  ' 

Proof:  g(yV:.,y2myn\(f(ylSj  ...  f(y2mj  =«!  f(*,)  -f^  -g 
The  second  part  follows. 

LEMMA  II.  Under  HQ, 

/ 

^  (l/2)*+<,  where  . yn  are  the  order  statistics  and 

*»0 

as  in  Lemma  I,  n  *  2m, i  <  m,  j  <  m-1,  and^  <  0. 


ph>'*i -jTwgwii  r  i?  ifwim  ifM  - 

Using  integration  by  parts,  we  obtain 

(Vwio-DirilH#1'1  [iM4"'Htrf;Tll(y'H4"-H 

|pWT  '(>»-/)  |  f(’vK  I 

Applying  integration  by  parts  to  the  inner  integral  repeatedly  yields 


(£).r  1  •••<*♦»  [f(^)J‘*m  |>-4, 

*=o 


n-i-k 
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Let  z  -  1  -  2F  yy.j  •  Then  we  have 

”  V  (frw-1)?  /  «-l  \  fi  ( l-*\*  '‘7  _«-/-*  d£  _  V  /*+M  \  n 

0-1)!  Z  At!  (jfc+j-1/  Jo  (  2  /  2  Z  \  A  /(  '  ' 

*=0  *=0 

which  was  to  be  shown. 

It  should  be  noted  the  probability  p|-y.  >yn_j  j  is  distribution-free  with  respect  to 
the  class  of  distributions  which  are  symmetric  about  zero.  It  is  not  distribution-free,  how¬ 
ever,  with  respect  to  the  class  of  distributions  symmetric  about  a  point  other  than  zero. 


LEMMA  III.  Using  the  same  hypothesis  as  in  Lemma  II,  except  that  i  <  m,  we  have 

PL  <„  <  i  _ "!  r°  r->i 

\  '*>  "-/  'I  L  Jo  J-V; 

(fwim  "'h'2  i -wi/fWfW  iy-i) 


Proof: 


■  wWi  >i/-  P  (FWIM 

4i'h' 


Using  integration  by  parts  gives  us  the  result 

Wh*  >«[‘-2FWi"'w+ 1/2  r  k^)]m 


2F(,,-/)-ii  [i'F(’,-/)i''1  1 7')^ 


Repeated  use  of  integration  by  parts  produces  the  result 

i  (w-'h^-2)  Ifw|“h 

*= 0 

(l/2y*+l  (l-2F(^)J f^dv, 

We  let  z  =  l-2F^.j;  using  a  method  described  in  reference  1*  yields 

y  d/2)/^1  =  (^)(i/2f/» 

<R> 

which  was  to  be  shown. 

It  should  again  be  noted  that  the  probability  p[->».+  j  <yn/  <  -yi  J  is  distribution- 
free  with  respect  to  the  class  of  distributions  symmetric  about  zero  and  no  other  point. 
These  lemmas  are  used  in  DISTRIBUTION  OF  S. 

*See  REFERENCES. 
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[f  we  let  A  be  the  event  -y.  >yn_j  >  j ;  and  let  B  be  the  event  -^f+1  >yn_j\  then 

A^jB  is  the  event  -yj  >y  ..  Since  A  and  B  are  mutually  exclusive,  ?[AKJB\  =  P[/t] 

+  P[f?]  or 

£  (**M)  (1/2)^'  =  £  ) ( 1  /2)*+ * +  (/+/)  (l/2)'+/+1 

k=0  k= 0 


The  above  is  shown  analytically. 

i  (T)<i/2,‘“ = i  (‘t’)(i/2,t,it  s  o <iw 

*=0  *=0  *=1 

We  let  Ar-1  =>>  in  the  second  summation  on  the  right;  then  we  obtain 

/  /  l-l 


2  (T)0™**1*  2  (*?')  2  (7)<I/2V 

i-eft  v  =  fl 


-  £  (^■1)  (l/2)*+1  +  £  (^')(l/2r+2-('^(l/2)/+2 

*»0  y»0 

Rearranging  terms  yields 

1/2  E  (*>/2)‘+1  =  ^  (^’1)(1/2)*+1-(7)(1/2)/+2 

*-0  **0 


We  multiply  both  sides  of  the  last  equation  by  (l/2)/_1  to  obtain  the  desired  result. 


THE  STATISTIC  S 


Lemma  II  of  reference  2  strongly  influences  the  definition  of  the  statistic  S  which 
we  shall  shortly  present.  To  this  end,  then,  we  consider  an  n -dimensional  random  sample 
of  size  one,  x  j , ... ,  rB ,  where  each  x(  has  the  distribution  F(x)  as  stated  in  /fQ  and  where 
n  =  2m.  We  form  the  order  statistics .y  j  ,y2, ...  ,y  ,  and  change  the  algebraic  signs  of  the 
first  m  order  statistics  to  get 

~yV~yV  •"  m'^m+V '^n' 

Let  us  write  the  sample  as  Wj , ... ,  wm  ,ym+i , ...  ,yn.  If  we  now  order  the  quantities  w 
and  the  quantities  y,  we  get  an  arrangement  which  may  look  as  follows: 

?m*l  <  wm  <  wm-l  <^w+2  < 
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This  is  just  one  of  the  — —  possible  arrangements.  We  now  define  S  as 
m\m\ 


s-  2  *(,,) 


l-W  +  1 


where  is  the  rank  ofyf  in  the  ordered  sample  of  w  and.y,  It  is  easily  seen  that 

<S< .  xhe  second  sum  occurs  when  all  the  w  are  less  than  all  the_y, 


and  th  .irst  sum  occurs  when  the  situation  is  just  reversed.  If  is  true,  we  should  expect 
the  w  to  be  randomly  placed  between  the  y.  If  the  w  and  y  actually  alternate, 

S  =  m(m+ 1)  or  m2,  depending  on  whether  we  start  with  a  w  or  a  y.  Because  of  // j ,  we 
should  expect  S  to  be  larger  under  than  under  HQ.  For  this  reason  the  decision  rule 
is  as  follows: 

Reject  Hq  if  S  >  X  where  X  is  an  integer  so  chosen  as  to  yield  a  false-alarm  probability 
which  is  less  than  or  equal  to  a. 


DISTRIBUTION  OF  5 


By  definition,  S  is  the  sum  of  ranks  -  that  is,  of  positive  integers  and  the  same 
value  of  5  can  be  obtained  by  adding  together  different  integers. 

The  definition  of  S,  therefore,  demands  that  we  count  the  number  of  partitions  of 

the  integer  8  which  belong  to  tne  range  of  S\  -.1.^  <  8  <  ^  restric- 

2  2 

tion  that  the  number  of  components  in  each  partition  is  m.  The  generating  function  for 
enumerating  partitions  with  unequal  parts  by  number  of  parts  is  (page  1 1 3,  reference  3): 

«• 

G(f,a)  =  ]>  u(U)fl* 

*=0 

where  u (t,k)  is  the  generating  function  for  partitions  with  k  unequal  parts.  Therefore,  let 
us  define  Gb  n(t,a)  as: 

n-b*  1 

GA,„(f.«)=  ^  u(r,*)a* 

*= 0 

G b  „(f.a)  is  the  function  for  enumerating  partitions  of  an  integer  q  with  unequal  parts  by 
number  of  parts  where  the  smallest  part  is  greater  than  or  equal  to  b  and  the  largest  part 
is  less  than  or  equal  to  n.  The  function  u(r,k)  is  the  generating  function  for  partitions 
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with  k  unequal  parts  with  the  restrictions  mentioned  for  Gb  n(t,a).  This  leads  us  to  the 
following  theorem: 


THEOREM  I.  If  in  Gb  n(t,a)  we  let  k  =  m,  we  obtain 

Ifm+b- 1 

u(r,m)  =  i - L 

Proof:  Gb  n(t,at)  =  (l-taf6+1)...  (lW+1)  and  (lW+1J  Gb  n(t,a ) 

*  (l***)  G mM),  so  that 
n-A+1  n-b+1  n-b+1 

^  u(r,m)am  +  ^  u(r,m)flw+V+1  =  ^  ^  u(r,w)am+1r+i 

m^O  m=0  m»0  m= 0 

Making  a  change  of  the  variable  of  addition  in  the  second  and  last  summations  and  equating 
coefficients  of  am  results  in 

u(t,m)[l-tm]  =  u(t,m-l)|fw+*-1-r',+1]  (1) 

or 

L/n+ft-l.ri+l) 

u(r,m)  =  u(r,m-l)  - — - - - —  (2) 

(,"fW) 

With  u(/,0)  =  1  and  repeated  iterations  of  (2),  the  desired  result  is  produced.  Definition: 
Let  N (b,  m,  n,  r)  be  equal  to  the  coefficient  of  tr  in  the  expansion  of  u(r,m)  in  powers  of  t. 
N (b,  m,  n,  r)  is  the  number  of  partitions  of  the  integer  r  into  m  distinct  parts  such  that  all 
parts  are  greater  than  or  equal  to  b  and  less  than  or  equal  to  n.  It  should  be  noted  that  N  is 
actually  a  function  of  only  three  distinct  variables  -  b,  m,  and  r  -  since  n  =  2m.  In  order 
to  find  the  distribution  of  S,  we  must  know  the  probability  of  each  of  the  possible 
arrangements  of  the  combined  orderings  of  the  w  and  y.  If  we  order  the  combined  sam¬ 
ples  of  w  and  y,  the  result  is  that  we  obtain  either 

a.  A  run  of  w,  then  a  run  of/,  then  a  run  of  w  again,  etc.,  ending  either  in  a  w  or  ay; 


u(r,m)  =  u(r,m-l) 


b.  A  run  of  y,  then  a  run  of  w,  then  a  run  of y  again,  etc.,  ending  either  in  a  tv  or  ay. 
The  following  theorem  produces  the  probability  of  a  particular  arrangement  of  tv  and/., 

THEOREM  II.  (1)  Under  UQ 

P jeondition  (b)  above  J  *  ( 3^’1)  P  J/k  <  wm  <yJt+1j=  (l/2)3m_* 

/i-rank  (»•’  >*1 

=  (1/2)  m 

where  yk ,  m+ 1  <  k  <  n  is  the  last  y  in  the  beginning  run  of  y  and  the  rank  (wm )  is  the 
rank  of  ( wm )  in  the  combined  sample  of  w  and/. 
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(2)  Under  HQ 

PJcondition  (a)  above]  =  (  ^ j  P | <>’m+]  <  tv^.,  ] 

=  <mrk-'  =  0/2^ 'w+' 

where  wk ,  1  <  k  <  m  is  the  last  of  the  w  in  the  beginning  run  of  w  and  rank  (ym+])  is 
the  rank  of>'m+1in  the  combined  sample  of  w  and_y.  ' 

Proof:  The  proof  is  given  for  the  case  where  n  =  6  and  m  =  3.  We  then  have 

yv  yi’  yy  y4,ys,y6 

~y  y~y  2’  ~y3,y4’  yS’  y6 

Wj,  w2,  w3,  y$’  y$- y^ 

The  number  of  ordered  arrangements  of  the  w  and>»  is  20,  and  they  are: 

w3<w2<  w,  <y4  <ys  <y6 
w3<tv2<^4<w,  <ys<y6 
w3<tv2<>'4<^5<tv,  <y6 

w3<w2<y4<y  s<yf><*] 

w3<yA<^2<wi  <ys<y6 
w3<y4<w2<ys<H'i  <y6 

w3<y4<w2<y5<y6<*, 

w3<y4<yS<W2<Wl  <y6 
w3<y4<y5<W2<y6<Wl 
*,3<y4<ys<y6<W2<Wl 

Now  interchanging  tv3  with  y4 ,  h>2  with  y 5 ,  and  w,  with  y6  yields  the  other  1 0  ordered 
arrangements.  We  now  consider  arrangements  which  satisfy  condition  (a,>  and  for  an 
example  consider 

w3<y4<w2<  wl<y5<y6 
w3<y4<W2<yS<Wl  <y6 
w3  <y4  <  W2  <y5  <yb  ^  W1 
w3<y4<y$<W2<Wl  <y6 
w3<y4<yS<W2<y6<Wl 
w3<y4<y5<y6<  ,;2<K,1 

Since  these  arrangements  exhaust  the  ways  in  which  y4  can  he  between  w3  and  w2 ,  and 
since  each  arrangement  is  mutually  exclusive  of  the  others,  it  is  clear  that 

p|tv3  <y4  <w2  <w,  <>-5  <^6j  +...+  P[w3  <y4<y5  <yb  <w2  <  w, 

=  p|w3  <y4  <  w2  j  =  ^ ~  )  (l/2)2+2+1  from  Lemma  III. 

9 


Next  we  wish  to  show  that  the  above  arrangements  are  equiprobable, 
p[H’a  <>4  <ps  < “a  < wi|  •  4^3  <>'4  K>s  < -yr  <yf < ^1] 

= «i°  r  4.  r  ix  '(’■ i 

= ®  r  c  Ct  c  iis  k*  i 

since  F(y)  =  l-F(-y), 

=  b!  1°  C  C6  P 1/2 

=  6!  l°fo%  1/4  ^  '“r  ■"” 

m&jlP  lPFp—^)iyh^ 


ro  ,  [l-2F(y,)]s 

6!  £  1/16  — si -  fi)^! 


At  JL  1  *  1 
32  6!  32 


p[w3<^4<w2<vj<^<w1J  *P|  ,y3  <>>4<*3'2<^  <>,6<^l| 

XXrUX 


Tj  <p<  <  *2  <  *1  <pJ  <p6 1  '  P  [  ■J’j  <  >4  <  *1  <  <  /j  <  -A  | 

■®x  a  4, 


It  is  easily  shown  that  the  other  three  probabilities  all  are  equal  to Therefore,  we 
obtain  the  general  result 

pjcondition  (a)  above]  =  (^j2)  pJ^  <ym+1  <  h>m  ]  =  (l/2)m+*'1 
The  example  immediately  gives  us  the  result 
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% 


In  a  similar  manner  we  show  that 


P  condition  (b)  above  j  =  p|y*  <  wm  <^+ 1 


=  ( l/2)3m-k 


n-rank(w)  +  1 

=  (1/2) 


Corollary  I 

r[ym  <  -  <  <^1  <  •••  <>’»] = <  -  <y»  <ym<  •••<  ] = w2> m 

The  proof  can  also  be  easily  obtained  with  Lemma  II.  Tire  distribution  of  S  is 
obtained  with  the  next  theorem: 


THEOREM  III.  Under  HQ 

P[s=r]  =  P[s=r  |  condition  (a)  is  true] 

+  P(s=n  |  condition  (b)is  true] 

=  (N(l,m,  n,r )  -  N(2, m,»,r)]  (l^)3"1'1 

m+ 1 

+  ^  (N(h,m,n,r)  -  N(i+1, m,n,r)]  (l/2)m+*~2 
b=  2 

where  N(m+2,m,n,r)  =  0 

Proof:  The  proof  of  the  theorem  follows  from  the  definition  of  N(ft,m,n,r)  and 
Theorem  II. 


Definition:  X  is  the  critical  integer;  that  is,  P[S>  X]  <  a 

If  a  =  Iff6,  we  need  to  consider  a  sample  of  dimension  greater  than  or  equal  to  40  to 
obtain  the  desired  critical  region.  The  distribution  of  S  for  n  *  2, 4, 6, 8  appears  in 
table  1.  The  mean  and  variance  of  S  are  obtained  in  the  next  two  theorems. 


THEOREM  IV 


•  i  m-1  / 


t  (T)(. nr 


Proof:  Let  U 


i.  n-j 


/=  1  /“  0  *=0 

,if  V- 

..  Then  >  U.  .  equals  the  number  of  times 

/  if  n*/ 

0  otherwise  yy' 


the  negative y  (that  is,  the  w)  exceed  the  positive y,  so  that  S  =  ..  ^  U.  .  ..  and 


,_m(3m+l)  ST  n 

“  "  Z  n/ 1 

U 
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inOW— IK*» 


m  .**3.  I  p \y,>y.., 


I./ 

m  m- 1  / 


-aaru- HI 


i=l  /=0  *=0 


THEOREM  V 


m  m-1  / 


v[s|=  III  (?)«*”- 


1=1  /=0  *=0 

m  m-1  *-l  / 


m  m-1  / 

1 2 1 

/=1  /=0  *=0 


’*+1-1 

* 


l)(l/2^ 


-2222  ("?') (,,2r' 


1=1  *=2  /*0  9=0 
m-1  m-1  *-l  / 


-in  ir>^‘ 

/=0  *=3  /“I  9-0 


m  M  m-1  *-l  / 


-22  222  0**' 


8-3  8=1  *-2  /-0  9-0 

8-1  m-1  *-l  r  ,  8-1 W 


+  2 


m  X-l  m-1  *-l  f  ,  H-l-i  .  .  K  . 

22  2  2(2r>H(2(,>'H 

8-3  i=l  *-2  /-0l  n-0  9-0 


*  2  (’T) (mri 


9*0 

Proof:  Since  S  is  of  the  form  b  +  w,  b  a  constant  and  w  a  random  variable,  and 
since  V(b  +  w)  ®  V[w] ,  it  is  easily  seen  that 

V[il  =v|^u,,,|.e(I u,„.,)  - (2 e(u, 


U 


U 


u 


■  Ie(uw  I*  2  eK-/M 

i,l  i+k  ,  /-*  , 

j#  m  m  -  /-« 


m  m-1  / 


2  2  2  (T)  <>«>**' 

8=1  /=0  *=0 
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But,  E[U2]  =  E[U]  so 

m  m- 1  /  i  m  m- 1  / 

vlsl= 2 2 1  rt')(i,2H2  2  2  (^i1) 

»=1  ;=0  *=0  1  i=l  /=  0  *=0 

♦  2  e|va.*-«] 

If  r=fi,  and  n-j  f  n-k,  then 
E [U/,  n-j  Uf,  n-k  1  =  P[*/  >yn.j  -yj  >yn-k  ] =  P|^  >^]when *  =  max  \n-j,  «-*J. 

If  n-j  =  n-k  and  i  £,  then 

E[u/,  n-j  uc,  n-j  ] =v\-yb  >yn.\  where  b = max  ['-  4 

If  /  >  fi  and  n-j  >  n-k,  then 

E(v/uu-.|-ph>>4 

If  i  <  £  and  n-j  >  n-k,  then 

E  [u/,  n-j  u£.  n-k  }a?[y  <yn-j  <  -yt  ]  ?[y  >yn-k  ]+?\-y  >  V/ j- 

Substituting  the  probabilities  into  the  formula  produces  the  desired  result. 


TWO-SAMPLE  COMPARISON 

Since  the  purpose  of  ordering  the  n-dimensional  sample,  then  changing  the  signs 
of  the  first  m  (n  =  2m)  values,  was  to  compare  half  of  the  sample  (the  upper  half)  with  the 
remaining  half  (the  lower  hall),  it  is  therefore  appropriate  to  ask  how  the  foregoing 
development  would  appear  if  we  actually  used  two  independent  m-dimensional  samples. 
We  therefore  wish  to  prove  the  following  lemmas. 


THEOREM  VI.  Given  x(1),  ...,x^  and  two  independent  m -dimensional 

samples  of  order  statistics  from  a  cdf  F(x)  when  F(x)  need  only  be  continuous,  then 


P'*(m-0  <y(m-j)  <x(m-i+ 1) 


■  or  "w 


™<x) 


i<m;  j<m 


Proof:  With  the  use  of 
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wfcre  n- 1  observations  have  cdf  F  and  one  observation  has  cdfG,  rg  is  the  rank  of  the 
observation  having  cdf  G  in  the  combined  sample;  the  theorem  can  easily  be  proved. 


THEOREM  VII,  Given  the  hypothesis  stated  in  Theorem  VI,  then 


m 


m 


k=m-i 


where  i<m  and /  < m. 


Proof: 


Pfx(m-/+1)  >'y(/«-/)]  =  ?[y m  j<x(m-i+ 1)  *x(m-/+l) 

=  *(m-/+l)|  dP(*(m-/+l)-x(m-/+l)] 

=  X  dP[x(m-/-*-l)^JC(m-»+l)J 

■  2  r(r)Kw.,)H'-f(w,  Jh 

*=m-/ 

kmm-j 

Letu  =*^F  ■*(„_(+ i)j  ;  then  we  have 
m 

•  £  (^*)  /** n*+m-,(l^)w**+,-Id« 

k*m~j 

.„„i  _gs> . 

L-J 

which  was  to  be  shown. 

THEOREM  VIII.  Given  the  hypothesis  stated  in  Theorem  VI,  then 

/ 

«“  4w„  »«„./) l  *  2  (T) wr> 

m+o »  *-o 
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} 


m 

Proof;  lim  p[x(m.w)  >yw.fl  ]  =  lim  J 


QO 


Let  g  =  k-m+j',  then  we  have 


-  i  cap 


U*h) <twl) 


,  which,  through  a  method  described  in 


/ 

reference  4,  results  in  ^  f^i)  Now,  letting ;-l  =  w,  we  obtain 

/-  0 

°  / 

^  ^  (l/2)w+1  =  ^  ^W+I  (l/2)w+l.  However,  by  Lemma  II  this  is 

w“/  w=*0 

»«-/]• 


THEOREM  IX.  Given  the  hypothesis  stated  in  Theorem  VI,  then 
lim  Pj*(m_0  <-P(m-/)  <x(m-/+l)  ]  =  (  /)  W2^+1 


(m-i)  (m)(m) 

- — —  =  )  (l/2)'+/+l  which  is  obtained  by  using 

.  ... /2ffi\  V  i  / 

W 


Proof:  lim 


reference  4.  However,  by  Lemma  III  this  is  P|-yj+1  <ym.j  <  -yt J.  As  demonstrated,  the 
distribution  of  S  depends  directly  on  Lemma  III  and  Lemma  II,  so  that  for  large  values  of 
m,  the  same  values  for  p|-y/+(  <yn.j  <  -7,1  and  pj-^  >yn.j  j  are  obtained  in  the  one-  and 
two-sample  case,  and,  hence,  the  procedure Jof  using  S  leads  to  the  same  result.  It  is  known 
that  if  we  have  two  independent  samples,  the  statistic  S  becomes  the  Mann-Whitney- 
Wilcoxon  statistic,  which,  as  is  well  known,  is  asymptotically  normally  distributed. 
Because  of  the  great  difficulty  of  obtaining  detection  probabilities,  they  are  not  included  in 
this  report. 


APPLICATIONS 

The  distributions  considered  in  hypotheses  Hq  and/Zj  satisfy  Lemma  1,  reference 
2,  and  because  of  Lemma  II,  reference  2,  a  one-sided  test  was  considered.  If  we  are  inter¬ 
ested  in  detecting  a  single  enemy  object,  such  as  an  airplane,  using  a  radar  device,  we  can 
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do  better  using  the  statistics  considered  in  reference  2  than  the  one  considered  here, 
because  the  statistic  S  is  relatively  insensitive  to  one  extreme  value  in  that  it  does  not  con¬ 
sider  its  magnitude.  The  statistic  S  is  sensitive  to  enemy  objects  in  the  plural  -  to  a  fleet 
of  gun  boats,  for  example,  or  a  number  of  dense  air  targets  or  hostile  troops.  It  would  serve 
a  valuable  function  therefore  in  detecting  a  concentration  of  the  enemy. 

A  second  application  is  that  of  intercept  of  frequency-hopped  signals  (radar 
or  communications). 

Suppose  that  the  message  or  radar  transmission  consists  of  a  number  of  frequency- 
hopped  pulses. 


The  detection  system  illustrated  uses  a  bank  of  narrowband  filters,  each  of  bandwidth  W. 
The  output  of  each  filter  is  energy-integrated  over  time  T,  the  message  length  or  radar 
transmission  length.  (This  could  be  continuous  integration  of  a  moving-window  type  or 
some  form  of  integrate-and-dump  -  the  false-alarm  rate  depends  on  the  decision  rate.)  In 
the  presence  of  a  frequency-hopped  signal,  a  substantial  portion  of  the  frequency  cells  are 
occupied  at  various  times  during  the  signal  time  T. 

The  distribution  of  the  integrated  energy  Vt(t)  is  given  by  the  chi-square  distribu¬ 
tion  with  2  TW  degrees  in  the  noise  case  (reference  5).  TW  will  be  very  large,  and  Urkowitz 
shows  that  for  large  values  of  TW  the  normal  distribution  is  a  very  good  approximation  to 
the  distribution  of  K(f).  Since  the  quantities  V((t)  have  been  normalized  by  the  noise 
power  density  for  the  respective  filters,  all  xf  will  have  approximately  the  same  variance. 
Thus,  the  xi  are  identically  and  symmetrically  distributed  about  zero  in  the  noise  case.  In 
the  presence  of  certain  types  of  interference,  this  will  not  always  be  true.  However,  the 
statistical  test  described  tends  to  alarm  only  when  a  substantial  number  of  resolution  cells 
are  occupied  (even  at  low  signal-to-noise  ratios),  and  its  false-alarm  rate  will  not  increase 
appreciably  when  only  one  or  a  few  cells  have  interference  signals,  even  very  strong  signals 
(fig.  1 ).  Thus,  the  false-alarm  rate  will  tend  to  be  constant  even  in  the  presence  of  noise 
plus  interference. 


TEST  STATISTIC 


CONCLUSIONS  AND  RECOMMENDATIONS 


It  has  been  shown  that  the  statistic  S  has  the  property  that  any  value  of  a  can  be 
closely  obtained  provided  the  dimension  of  the  sample  is  large  enough  and  the  underlying 
distribution  F(x)  has  the  property  that  F(x)  =  1  -  F(-x).  Also,  if  we  consider  two  indepen¬ 
dent  samples  from  a  continuous  distribution  F(x)  and  the  dimension  of  both  samples  is 
large,  then  the  procedure  using  S  leads  to  the  same  result  in  both  cases;  that  is,  the  case  in 
which  F(x)  =  1  -  F(-x)  and  one  sample  is  used  and  the  case  in  which  F(x)  is  continuous  and 
two  samples  are  used.  The  test  procedure  using  S  seems  to  be  poor  in  one  radar  application 
(against  single  targets)  but  relatively  good  in  another  (against  multiple  targets).  It  was  also 
indicated  that  S  can  be  used  to  intercept  frequency-hopped  signals. 

It  is  recommended  that  before  this  new  detector  is  implemented  some  detection 
probability  results  be  obtained  either  by  analysis  or  by  simulation  by  Monte  Carlo  methods, 
for  example. 
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